Theory and Algorithms for the Bandit Problem
https://gyazo.com/478780289217f7353fd0a54f759bf34e
Theory and Algorithms for the Bandit problem - Machine Learning Professional Series.
Junya Honda (Author), Atsuyoshi Nakamura (Author)
Amazon
I heard that Thompson sampling on logistic regression is discussed in chapter 7.
http://nbviewer.jupyter.org/github/hagino3000/notebooks/blob/master/MLP_bandit/Chap7_binary_reward.ipynb
---
This page is auto-translated from /nishio/バンディット問題の理論とアルゴリズム using DeepL. If you looks something interesting but the auto-translated English is not good enough to understand it, feel free to let me know at @nishio_en. I'm very happy to spread my thought to non-Japanese readers.